# First Determination of Thermal Resistance and Thermal Capacitance of Atomic-Layer-Deposited In<sub>2</sub>O<sub>3</sub> Transistors

J.-Y. Lin, Z. Zhang, S. Alajlouni, P.-Y. Liao, Z. Lin, C. Niu, A. Shakouri, and P. D. Ye<sup>\*</sup> School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA, \*Email: <u>yep@purdue.edu</u>

Abstract — Electrical and thermal co-design and cooptimization become more and more important for the stateof-the-art monolithic 3D integration. In this work, for the first time, we determined the thermal resistance  $(R_{TH})$  and the thermal capacitance (C<sub>TH</sub>) of back-end-of-line (BEOL) compatible atomic-layer-deposited (ALD) ultrathin In2O3 field-effect transistors (FETs) by measuring the steady-state and transient temperatures of active devices using a thermoreflectance (TR) imaging system. Through the extracted R<sub>TH</sub> and C<sub>TH</sub>, the heat dissipation capability of In<sub>2</sub>O<sub>3</sub> FETs is found to be related to the geometry of the transistors. An 83% reduction of R<sub>TH</sub> and a 379% increase of C<sub>TH</sub> can be obtained by scaling down the channel length (L<sub>ch</sub>) of In<sub>2</sub>O<sub>3</sub> FETs from 6 µm to 600 nm. This work offers a new methodology to quantitatively study the thermal properties of thin film transistors along with their electrical performance.

## I. INTRODUCTION

Development of BEOL-compatible devices is significantly important for monolithic 3D integration as a promising solution to increase on-chip transistor density and keep the semiconductor technology moving forward [1–2]. Recently, ALD In<sub>2</sub>O<sub>3</sub> FETs have stood out due to their low thermal budget under 350 °C [3], good conformality and uniformity on 3D structure [4], high electron mobility over 100 cm<sup>2</sup>/V·s [4], maximum on-current near 20 mA/µm [5] and ultrahigh reliability [6]. One remaining challenge of In<sub>2</sub>O<sub>3</sub> transistors is their self-heating effect (SHE), making the devices unstable at high V<sub>DS</sub> biases thus limiting their achievable high I<sub>DS</sub> [7–10]. To resolve this issue, a complete quantitative investigation of the thermal properties of In<sub>2</sub>O<sub>3</sub> FETs is a must.

In this work, thermal resistance ( $R_{TH}$ ) and thermal capacitance ( $C_{TH}$ ) which can quantify the thermal properties of a transistor, are determined for the first time by measuring steady and transient temperatures using a thermo-reflectance (TR) imaging system [11]. From different device geometry, we found that using a high thermal conductivity substrate and scaling down the size of the devices are two crucial ways for better heat dissipation and reduction of the SHE.

## II. EXPERIMENTS AND DEVICE PERFORMANCE

**Fig.1** illustrates the schematic device structure of top-gated  $In_2O_3$  transistors with two types of substrates, 90 nm SiO<sub>2</sub> on p+Si (SiO<sub>2</sub>/Si) and 5 nm ALD grown HfO<sub>2</sub> at 200 °C on highly resistive Si (HfO<sub>2</sub>/HR-Si). The HfO<sub>2</sub> on HR-Si is served as a thermal adhesion layer to improve the thermal interfacial conductance [10]. After the solvent cleaning of the substrate, 1.3 - 2 nm In<sub>2</sub>O<sub>3</sub> was grown by ALD at 225 °C, followed by dry etching for channel isolation. Next, 40 nm Ni was deposited by e-beam evaporation as source/drain contacts. 7 nm HfO<sub>2</sub> top dielectric layer was grown by ALD at 120 °C [4].

Finally, 20/30 nm Ni/Au top-gate metal was deposited by ebeam evaporation, followed by  $O_2$  annealing at 250 °C.

Electrical performance of a top-gated (TG) In<sub>2</sub>O<sub>3</sub> transistor with a channel length (L<sub>ch</sub>) of 200 nm and channel thickness (T<sub>ch</sub>) of 1.3 nm on SiO<sub>2</sub>/Si substrate is shown in **Fig. 2**. Drain current (I<sub>D</sub>) of 230  $\mu$ A/ $\mu$ m can be achieved at V<sub>GS</sub> = 4 V and V<sub>DS</sub> = 1 V. While higher drain current of 1 mA/ $\mu$ m can be obtained by reducing the L<sub>ch</sub> to 40 nm [7], serious SHE will happen and make the device unstable. Therefore, it is necessary to comprehensively study device thermal properties and find a new route to further enhance device performance.

# III. STEADY-STATE THERMO-REFLECTANCE MEASUREMENT

Fig. 3 illustrates the setup of the TR imaging system. During the TR measurement, a periodic pulsed V<sub>DS</sub> signal and green light light-emitting diode (LED) are applied to the device under test (DUT). In addition, a synchronized charge-coupled device (CCD) camera is utilized to capture the surface reflectance signal from the DUT. The working mechanism of the TR imaging system is presented in Fig. 4 [8]. The periodic pulsed V<sub>DS</sub> can switch the device ON and OFF. At the ON /OFF state, the DUT heats up/cools down, and the surface reflectance signal can be captured as an active/passive image when reaching steady-state. The difference in surface reflectivity between active and passive images can be transformed into temperature rise ( $\Delta T$ ) by the calibrated thermal coefficient of the surface gate metal [8-11]. Fig. 5 presents the  $\Delta T$  distribution of In<sub>2</sub>O<sub>3</sub> transistors with the same channel width ( $W_{ch}$ ) of 2 µm but different  $L_{ch}$  from 2 µm to 600 nm. Interestingly, the  $\Delta T$  near the channel center seems to be dependent on the L<sub>ch</sub> of the transistors. The  $\Delta T$  along channel width was extracted and normalized by power density (PD), which is defined as  $(I_D \times V_{DS}) / (L_{ch} \times W_{ch})$ , as shown in **Fig. 6**. It can be noticed that smaller L<sub>ch</sub> devices demonstrate lower  $\Delta T/PD$ , which indicates a size-dependent heat dissipation pattern in In<sub>2</sub>O<sub>3</sub> transistors.

### **IV. THERMAL RESISTANCE EXTRACTION**

Fig. 7 presents the maximum  $\Delta T$  vs. PD relation of  $In_2O_3$ FETs with the same  $W_{ch} = 2 \mu m$  but different  $L_{ch}$ . Different PD is controlled by modifying  $V_{DS}$ . From the slopes of the linear regression lines of  $\Delta T$  vs. PD data [12], normalized  $R_{TH}$  can be extracted and represented by the blue symbols in Fig. 9. An 80% reduction of  $R_{TH}$  from 11.5 to 2.1 ×10<sup>-9</sup>·K·m<sup>2</sup>·W<sup>-1</sup> can be observed when  $L_{ch}$  reduces from 8  $\mu$ m to 600 nm, which means a better heat dissipation in smaller  $L_{ch}$  devices. A similar trend of  $R_{TH}$  dependency on device geometry applies to devices with the same  $L_{ch} = 2 \ \mu m$  but different  $W_{ch}$  as illustrated in Fig. 8 and Fig. 9. A 65% reduction of  $R_{TH}$  from 7.6 to 2.7 ×10<sup>-9</sup>·K·m<sup>2</sup>·W<sup>-1</sup> with  $W_{ch}$  reducing from 8  $\mu m$  to 600 nm further confirming the improvement of heat dissipation by scaling device dimension.

**Fig. 10** illustrates the principle of  $L_{ch}$ - and  $W_{ch}$ -dependent heat dissipation in  $In_2O_3$  devices. Joule heat generated in the channel can dissipate through five different paths: 1) vertically to the substrate, 2) laterally to the surrounding substrate, 3) laterally to the source and drain (S/D) metal, 4) vertically through gate dielectric to the gate metal, and 5) laterally through gate dielectric to the surrounding gate metal. When the  $L_{ch}$  decreases, heat dissipation from the channel to the surrounding substrate and S/D through paths 2) and 3) is enhanced, resulting in smaller  $R_{TH}$  for smaller  $L_{ch}$  devices as shown by the blue symbols in **Fig. 9**. Similarly, as the  $W_{ch}$  is scaled down, the heat dissipation via 2) and 5) is strengthened by the shorter distance between the channel and surrounding substrate and gate as illustrated by the red symbols in **Fig. 9**.

### V. THERMAL CAPACITANCE EXTRACTION

In addition to  $R_{TH}$ ,  $C_{TH}$  is another important parameter to learn the thermal properties of a device. In this work, a new method using steady-state and transient TR measurements as described in **Fig. 11** is proposed to extract  $C_{TH}$ . Assuming a simplified 1D thermal circuit model, our In<sub>2</sub>O<sub>3</sub> transistors on the substrate can be modeled by an equivalent thermal resistor and a thermal capacitor. The value of the thermal resistor is the  $R_{TH}$  extracted from steady-state TR measurements. The normalized thermal capacitor value,  $C_{TH}$ , can be calculated by the equation [13]:

$$\tau_{\rm TH} = R_{\rm TH} C_{\rm TH} \tag{1}$$

where  $\tau_{TH}$  is the equivalent thermal time constant of the In<sub>2</sub>O<sub>3</sub> FET and can be extracted from the transient TR measurement.

Fig. 12 illustrates the mechanism of the transient TR imaging system. V<sub>DS</sub> pulses were set to start at 0 ns and end at 500 ns in every V<sub>DS</sub> cycle [8]. Active images were first taken at t = 0 ns and a following time step of 10 - 200 ns to capture the  $\Delta T$  distribution change with time as shown in Fig. 13. The maximum  $\Delta T$  at each moment from the devices with different geometry are plotted in Fig. 14 (a) and Fig. 15 (a). As can be seen, all transient  $\Delta T$  results are similar and independent of the device geometry. To be more specific, all devices reach steadystate at around 200 ns during heat-up and cool down to  $\Delta T = 0$ K at around 700 ns. Fig. 14 (b) and Fig. 14 (c) present the time constant (\tau) extraction of devices with the same  $W_{ch} = 2 \ \mu m$ but different L<sub>ch</sub> during the heat-up ( $\tau_{heat}$ ) and cool-down ( $\tau_{cool}$ ) process, respectively. The extracted  $\tau$  were summarized in Fig. 14 (d). Following the same process, the  $\tau$  of the devices with the same  $L_{ch} = 2 \ \mu m$  but different  $W_{ch}$  are also extracted and summarized in Fig. 15 (d). Corresponding to our observation in the transient  $\Delta T$  results in Fig. 14 (a) and Fig. 15 (a), the average thermal time constant ( $\tau_{TH}$ ) is independent of the transistor geometry despite some small variation. Next, using (1) and the R<sub>TH</sub> from Fig. 9, normalized thermal capacitance  $(C_{TH})$  can be calculated and presented in Fig. 16. It is found that scaling down the L<sub>ch</sub> or W<sub>ch</sub> of the devices can both increase the C<sub>TH</sub>. Considering the physical meaning of C<sub>TH</sub>, the amount of energy required to elevate the temperature of the transistor by 1 K, a larger C<sub>TH</sub> value implies a more difficult heat-up process and more efficient heat dissipation. In other words, smaller devices would have better heat dissipation capability because of their larger  $C_{TH}$ , which corresponds to previous  $R_{TH}$  results in section IV. To verify the extracted  $C_{TH}$ , theoretical normalized-thermal capacitance ( $C_{TH}$ , theory) was calculated by the equation [14]:

$$C_{TH, theory} = \mathbf{M} \cdot \mathbf{C}_{P} / (\mathbf{L}_{ch} \cdot \mathbf{W}_{ch})$$
$$= \mathbf{A}_{heat} \cdot \mathbf{t}_{sub} \cdot \mathbf{C}_{P} / (\mathbf{L}_{ch} \cdot \mathbf{W}_{ch})$$
(2)

where M, A<sub>heat</sub>, and t<sub>sub</sub> (SiO<sub>2</sub>: 90 nm) are the mass, area, and thickness of the substrate that participates in the heat dissipation, respectively. C<sub>P</sub> is the specific heat of the substrate (SiO<sub>2</sub>: 725 J/(kg·K) [15]).  $\rho_{sub}$  is the density of the substrate (SiO<sub>2</sub>: 2210 kg/m<sup>3</sup> [15]). C<sub>TH, theory</sub> is normalized by the area (L<sub>ch</sub>·W<sub>ch</sub>) of the In<sub>2</sub>O<sub>3</sub> FET. If assuming A<sub>heat</sub> = L<sub>ch</sub>·W<sub>ch</sub> = 2  $\mu$ m·2  $\mu$ m, the according C<sub>TH, theory</sub> = 0.14 J·K<sup>-1</sup>·m<sup>-2</sup> is smaller than the experimental C<sub>TH</sub> = 9.51 J·K<sup>-1</sup>·m<sup>-2</sup>. Nevertheless, if considering the lateral heat dissipation through 2), 3), and 5) paths, which means the A<sub>heat</sub> is larger than the device area (L<sub>ch</sub>·W<sub>ch</sub>), and using A<sub>heat</sub> = 16  $\mu$ m·16  $\mu$ m, the C<sub>TH, theory</sub> = 9.23 J·K<sup>-1</sup>·m<sup>-2</sup> would be close to the experimental C<sub>TH</sub>. This suggests a large portion of lateral heat dissipation as assumed in **Fig. 10**.

The R<sub>TH</sub> and C<sub>TH</sub> of the In<sub>2</sub>O<sub>3</sub> FETs on HfO<sub>2</sub>/HR-Si substrate are also extracted. **Fig. 17** exhibits that the device on HfO<sub>2</sub>/HR-Si substrate has lower  $\Delta$ T than SiO<sub>2</sub>/Si substrate at similar PD because of higher thermal conductivity of Si substrate [7–10]. The R<sub>TH</sub> and C<sub>TH</sub> of HfO<sub>2</sub>/HR-Si substrate shown in **Fig. 18** convey the same idea that HfO<sub>2</sub>/HR-Si substrate has better heat dissipation capability. For example, for the device with W<sub>ch</sub> of 2 µm and L<sub>ch</sub> of 6 µm, **Fig. 18** (b) demonstrates that the HfO<sub>2</sub>/HR-Si substrate increases the C<sub>TH</sub> by about 73% compared to SiO<sub>2</sub>/Si one. It should be noted that scaling the size of the In<sub>2</sub>O<sub>3</sub> transistors is also critical for heat dissipation. By reducing the L<sub>ch</sub> from 6 µm to 600 nm, the C<sub>TH</sub> is enhanced by 379% as plotted in **Fig. 18** (b).

#### VI. CONCLUSION

In summary, the  $R_{TH}$  and  $C_{TH}$  of ALD In<sub>2</sub>O<sub>3</sub> transistors are first determined by employing the TR imaging system. Through analyzing the extracted  $R_{TH}$  and  $C_{TH}$ , it is found that the device geometry is crucial to the heat dissipation of the devices due to the lateral heat dissipation to the surrounding substrate, S/D, and gate metal, besides using a high thermal conductive substrate. The proposed extraction methods for  $R_{TH}$ and  $C_{TH}$  provide a way to quantify the thermal properties of thin film transistors, which is essential to mitigate the SHE for future monolithic 3D integration. The work is supported by AFOSR, SRC nCore IMPACT Center, and DARPA/SRC JUMP ASCENT Center.

#### References

[1] M. D. Bishop *et al.*, *IEEE Micro*, vol. 39, no. 6, pp. 16-27, 2019. [2] S. Datta *et al.*, *IEEE Micro*, vol. 39, no. 6, pp. 8-15, 2019. [3] M. Si et al., *IEEE TED*, vol. 68, p. 1075, 2021. [4] M. Si et al., *IEEE TED*, vol. 68, p. 6605, 2021. [5] Z. Zhang *et al.*, *IEEE EDL*, vol. 43, p. 1905, 2022. [6] Z. Zhang *et al.*, *VLSI*, T11-3, 2023. [7] P.-Y. Liao et al., *IEEE TED*, vol. 69, p. 147, 2022. [8] P.-Y. Liao et al., *IEEE TED*, vol. 69, p. 147, 2022. [8] P.-Y. Liao et al., *IEEE TED*, vol. 61, p. 3047, 2014. [12] H. Zhou et al., *ACS Omega*, vol. 2, p. 7723, 2017. [13] C. Anghel *et al.*, *IEEE EDL*, vol. 25, p. 141, 2004. [14] Marilyn Wolf, *The Physics of Computing*, Chapter 5, p. 149, 2017. [15] S. Andersson *et al.*, *J. Phys.: Condens. Matter*, vol. 4, p. 6209, 1992.



Fig. 1. Schematic device structure of top-gated (TG) In<sub>2</sub>O<sub>3</sub> transistors with different substrates.



Fig. 4. Working mechanism of the steadystate TR imaging system [8].



Fig. 6. Cross-sections of the normalized temperature rise  $(\Delta T/PD)$  along channel width direction (cutline in Fig. 5) with various Lch.



Fig. 10. Schematic cross-sections of In2O3 transistors showing heat dissipation path (indicated by arrows) along (a) L<sub>ch</sub> and (b) W<sub>ch</sub> directions.



Fig. 2. (a) Transfer and (b) output characteristics of a TG In<sub>2</sub>O<sub>3</sub> transistor with channel thickness (Tch) of 1.3 nm, channel length (Lch) of 200 nm, and channel width (Wch) of 2 µm.



Fig. 3. Schematic of the thermoreflectance (TR) imaging system setup.



Fig. 5. (a) CCD image of the top view of an In<sub>2</sub>O<sub>3</sub> transistor. TR images of devices with the same  $W_{ch} = 2 \mu m$ ,  $T_{ch} = 2 nm$ , but different  $L_{ch} = (b) 2 \mu m$  and (c) 600 nm.



Fig. 7. Maximum  $\Delta T$  of In<sub>2</sub>O<sub>3</sub> devices with the same  $W_{ch} = 2$  $\mu m$  and  $T_{ch} = 2 nm$  but different and  $T_{ch} = 2 nm$  but different  $W_{ch}$ L<sub>ch</sub> at various PD.



Fig. 11. (a) Simplified 1D equivalent thermal circuit model of In<sub>2</sub>O<sub>3</sub> FETs. (b) Extraction method of thermal capacitance (CTH).







Fig. 8. Maximum  $\Delta T$  of In<sub>2</sub>O<sub>3</sub> devices with the same  $L_{ch} = 2 \mu m$ at various PD.

9. Extracted normalized thermal resistance (RTH). Blue symbols represent RTH at different L<sub>ch</sub> (extracted from Fig. 7). Red symbols represent RTH at different W<sub>ch</sub> (extracted from Fig. 8). It shows the heat is easier to dissipate to thick Ni S/D.



Fig. 12. Working mechanism of transient TR imaging system.  $V_{DS}$  pulses were set to start at 0 ns and end at 500 ns in every V<sub>DS</sub> cycle [8].



Fig. 13. Experimental transient TR image measurement of an In<sub>2</sub>O<sub>3</sub> device with  $W_{ch} = L_{ch} = 2 \ \mu m$  and  $T_{ch} = 2 \ nm. (a - c)$  exhibit the  $\Delta T$  distribution of the channel during heat-up process. (d – f) exhibit the  $\Delta T$  distribution of the channel during cool-down process.



Fig. 15. (a) Transient  $\Delta T$  results of In<sub>2</sub>O<sub>3</sub> transistors with the same  $L_{ch} = 2 \ \mu m$  and  $T_{ch} = 2 \ nm$  but different  $W_{ch}$  on SiO<sub>2</sub>/Si substrate. Time constant extraction of (b) heat-up and (c) cool-down process. (d) Extracted time constant at different  $W_{ch}$ .



Fig. 17. Steady-state TR images of In<sub>2</sub>O<sub>3</sub> devices with  $W_{ch} = 2 \mu m$ ,  $L_{ch} = 600 nm$ , and  $T_{ch} = 2 nm$  on (a) SiO<sub>2</sub>/Si and (b) HfO<sub>2</sub>/HR-Si substrate. The highest  $\Delta T$  in the devices was decreased from 16.6 to 5.0 K by changing the substrate material.



Fig. 14. (a) Transient  $\Delta T$  results of In<sub>2</sub>O<sub>3</sub> transistors with the same  $W_{ch} = 2 \ \mu m$  and  $T_{ch} = 2 \ nm$  but different  $L_{ch}$  on SiO<sub>2</sub>/Si substrate. The  $\Delta T$  from different devices were adjusted to similar level by choosing proper PD. Time constant extraction of (b) heat-up and (c) cool-down process.  $\Delta T_{st}$  represents the steady-state  $\Delta T$  in each device. The time (t)/ $\tau$  values are calculated by the fitting formulas in (b) and (c). (d) Extracted time constant at different  $L_{ch}$ .



Fig. 16. Normalized thermal capacitance ( $C_{TH}$ ) extracted using the thermal time constant ( $\tau_{TH}$ ) in Fig. 14 (d) and Fig. 15 (d). Blue symbols represent the  $C_{TH}$  at different L<sub>ch</sub>. Red symbols represent the  $C_{TH}$  at different W<sub>ch</sub>.



Fig. 18. Comparison of thermal parameters, (a)  $R_{TH}$  and (b)  $C_{TH}$ , of devices with the same  $W_{ch} = 2 \ \mu m$  and  $T_{ch} = 2 \ nm$  but different  $L_{ch}$  on SiO<sub>2</sub>/Si and HfO<sub>2</sub>/HR-Si substrates. Since  $\tau_{TH}$  is independent of device geometry,  $\tau_{TH} = 53.5$  ns on HfO<sub>2</sub>/HR-Si substrate was extracted from the In<sub>2</sub>O<sub>3</sub> device with  $W_{ch} = L_{ch} = 2 \ \mu m$  and used to calculate the  $C_{TH}$  of In<sub>2</sub>O<sub>3</sub> FETs on HfO<sub>2</sub>/HR-Si substrate.